Overview
Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 50000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.6 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 13 |
| Categorical | 6 |
age is highly overall correlated with credit_history_years and 1 other fields | High correlation |
annual_income is highly overall correlated with current_debt and 1 other fields | High correlation |
credit_history_years is highly overall correlated with age and 1 other fields | High correlation |
credit_score is highly overall correlated with loan_status | High correlation |
current_debt is highly overall correlated with annual_income and 1 other fields | High correlation |
debt_to_income_ratio is highly overall correlated with current_debt | High correlation |
interest_rate is highly overall correlated with product_type | High correlation |
loan_amount is highly overall correlated with annual_income and 2 other fields | High correlation |
loan_status is highly overall correlated with credit_score | High correlation |
loan_to_income_ratio is highly overall correlated with loan_amount and 1 other fields | High correlation |
payment_to_income_ratio is highly overall correlated with loan_amount and 1 other fields | High correlation |
product_type is highly overall correlated with interest_rate | High correlation |
savings_assets is highly overall correlated with credit_history_years | High correlation |
years_employed is highly overall correlated with age | High correlation |
defaults_on_file is highly imbalanced (69.9%) | Imbalance |
derogatory_marks is highly imbalanced (72.6%) | Imbalance |
customer_id has unique values | Unique |
years_employed has 3459 (6.9%) zeros | Zeros |
delinquencies_last_2yrs has 30797 (61.6%) zeros | Zeros |
Reproduction
| Analysis started | 2025-11-25 13:46:57.420338 |
|---|---|
| Analysis finished | 2025-11-25 13:47:23.995886 |
| Duration | 26.58 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
customer_id
Text
Unique
| Distinct | 50000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 50000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | CUST100000 |
|---|---|
| 2nd row | CUST100001 |
| 3rd row | CUST100002 |
| 4th row | CUST100003 |
| 5th row | CUST100004 |
| Value | Count | Frequency (%) |
| cust100000 | 1 | < 0.1% |
| cust100047 | 1 | < 0.1% |
| cust100023 | 1 | < 0.1% |
| cust100011 | 1 | < 0.1% |
| cust100002 | 1 | < 0.1% |
| cust100003 | 1 | < 0.1% |
| cust100004 | 1 | < 0.1% |
| cust100005 | 1 | < 0.1% |
| cust100006 | 1 | < 0.1% |
| cust100007 | 1 | < 0.1% |
| Other values (49990) | 49990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 80000 | |
| C | 50000 | |
| U | 50000 | |
| S | 50000 | |
| T | 50000 | |
| 0 | 30000 | 6.0% |
| 3 | 30000 | 6.0% |
| 2 | 30000 | 6.0% |
| 4 | 30000 | 6.0% |
| 5 | 20000 | 4.0% |
| Other values (4) | 80000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 500000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 80000 | |
| C | 50000 | |
| U | 50000 | |
| S | 50000 | |
| T | 50000 | |
| 0 | 30000 | 6.0% |
| 3 | 30000 | 6.0% |
| 2 | 30000 | 6.0% |
| 4 | 30000 | 6.0% |
| 5 | 20000 | 4.0% |
| Other values (4) | 80000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 500000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 80000 | |
| C | 50000 | |
| U | 50000 | |
| S | 50000 | |
| T | 50000 | |
| 0 | 30000 | 6.0% |
| 3 | 30000 | 6.0% |
| 2 | 30000 | 6.0% |
| 4 | 30000 | 6.0% |
| 5 | 20000 | 4.0% |
| Other values (4) | 80000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 500000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 80000 | |
| C | 50000 | |
| U | 50000 | |
| S | 50000 | |
| T | 50000 | |
| 0 | 30000 | 6.0% |
| 3 | 30000 | 6.0% |
| 2 | 30000 | 6.0% |
| 4 | 30000 | 6.0% |
| 5 | 20000 | 4.0% |
| Other values (4) | 80000 |
age
Real number (ℝ)
High correlation
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.95706 |
| Minimum | 18 |
|---|---|
| Maximum | 70 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 26 |
| median | 35 |
| Q3 | 43 |
| 95-th percentile | 54 |
| Maximum | 70 |
| Range | 52 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 11.118603 |
|---|---|
| Coefficient of variation (CV) | 0.31806459 |
| Kurtosis | -0.43633607 |
| Mean | 34.95706 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.33586008 |
| Sum | 1747853 |
| Variance | 123.62333 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 4562 | 9.1% |
| 35 | 1713 | 3.4% |
| 34 | 1695 | 3.4% |
| 37 | 1668 | 3.3% |
| 36 | 1662 | 3.3% |
| 32 | 1628 | 3.3% |
| 33 | 1620 | 3.2% |
| 30 | 1571 | 3.1% |
| 31 | 1561 | 3.1% |
| 38 | 1534 | 3.1% |
| Other values (43) | 30786 |
| Value | Count | Frequency (%) |
| 18 | 4562 | |
| 19 | 715 | 1.4% |
| 20 | 841 | 1.7% |
| 21 | 875 | 1.8% |
| 22 | 927 | 1.9% |
| 23 | 1069 | 2.1% |
| 24 | 1138 | 2.3% |
| 25 | 1279 | 2.6% |
| 26 | 1270 | 2.5% |
| 27 | 1370 | 2.7% |
| Value | Count | Frequency (%) |
| 70 | 88 | |
| 69 | 24 | < 0.1% |
| 68 | 29 | 0.1% |
| 67 | 41 | 0.1% |
| 66 | 58 | 0.1% |
| 65 | 62 | |
| 64 | 85 | |
| 63 | 94 | |
| 62 | 99 | |
| 61 | 150 |
occupation_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| Employed | |
|---|---|
| Self-Employed | |
| Student |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.9209 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Employed |
|---|---|
| 2nd row | Employed |
| 3rd row | Student |
| 4th row | Student |
| 5th row | Employed |
Common Values
| Value | Count | Frequency (%) |
| Employed | 34971 | |
| Self-Employed | 10179 | 20.4% |
| Student | 4850 | 9.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| employed | 34971 | |
| self-employed | 10179 | 20.4% |
| student | 4850 | 9.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 60179 | |
| l | 55329 | |
| d | 50000 | |
| E | 45150 | |
| m | 45150 | |
| p | 45150 | |
| o | 45150 | |
| y | 45150 | |
| S | 15029 | 3.4% |
| f | 10179 | 2.3% |
| Other values (4) | 29579 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 446045 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 60179 | |
| l | 55329 | |
| d | 50000 | |
| E | 45150 | |
| m | 45150 | |
| p | 45150 | |
| o | 45150 | |
| y | 45150 | |
| S | 15029 | 3.4% |
| f | 10179 | 2.3% |
| Other values (4) | 29579 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 446045 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 60179 | |
| l | 55329 | |
| d | 50000 | |
| E | 45150 | |
| m | 45150 | |
| p | 45150 | |
| o | 45150 | |
| y | 45150 | |
| S | 15029 | 3.4% |
| f | 10179 | 2.3% |
| Other values (4) | 29579 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 446045 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 60179 | |
| l | 55329 | |
| d | 50000 | |
| E | 45150 | |
| m | 45150 | |
| p | 45150 | |
| o | 45150 | |
| y | 45150 | |
| S | 15029 | 3.4% |
| f | 10179 | 2.3% |
| Other values (4) | 29579 |
years_employed
Real number (ℝ)
High correlation Zeros
| Distinct | 395 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.454868 |
| Minimum | 0 |
|---|---|
| Maximum | 39.9 |
| Zeros | 3459 |
| Zeros (%) | 6.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.3 |
| median | 4.9 |
| Q3 | 11.4 |
| 95-th percentile | 23.2 |
| Maximum | 39.9 |
| Range | 39.9 |
| Interquartile range (IQR) | 10.1 |
Descriptive statistics
| Standard deviation | 7.6120967 |
|---|---|
| Coefficient of variation (CV) | 1.0210907 |
| Kurtosis | 1.262256 |
| Mean | 7.454868 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.2936126 |
| Sum | 372743.4 |
| Variance | 57.944017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3459 | 6.9% |
| 1 | 2384 | 4.8% |
| 1.7 | 641 | 1.3% |
| 1.2 | 636 | 1.3% |
| 1.5 | 617 | 1.2% |
| 1.3 | 603 | 1.2% |
| 1.4 | 598 | 1.2% |
| 1.6 | 594 | 1.2% |
| 0.7 | 587 | 1.2% |
| 0.4 | 587 | 1.2% |
| Other values (385) | 39294 |
| Value | Count | Frequency (%) |
| 0 | 3459 | |
| 0.1 | 579 | 1.2% |
| 0.2 | 516 | 1.0% |
| 0.3 | 537 | 1.1% |
| 0.4 | 587 | 1.2% |
| 0.5 | 566 | 1.1% |
| 0.6 | 528 | 1.1% |
| 0.7 | 587 | 1.2% |
| 0.8 | 533 | 1.1% |
| 0.9 | 536 | 1.1% |
| Value | Count | Frequency (%) |
| 39.9 | 4 | |
| 39.7 | 3 | |
| 39.6 | 2 | < 0.1% |
| 39.5 | 7 | |
| 39.4 | 4 | |
| 39.2 | 1 | < 0.1% |
| 39.1 | 1 | < 0.1% |
| 39 | 6 | |
| 38.9 | 3 | |
| 38.8 | 2 | < 0.1% |
annual_income
Real number (ℝ)
High correlation
| Distinct | 35770 |
|---|---|
| Distinct (%) | 71.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50062.892 |
| Minimum | 15000 |
|---|---|
| Maximum | 250000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 15000 |
|---|---|
| 5-th percentile | 15000 |
| Q1 | 27280.5 |
| median | 41607.5 |
| Q3 | 62723.25 |
| 95-th percentile | 113904.4 |
| Maximum | 250000 |
| Range | 235000 |
| Interquartile range (IQR) | 35442.75 |
Descriptive statistics
| Standard deviation | 32630.501 |
|---|---|
| Coefficient of variation (CV) | 0.65179017 |
| Kurtosis | 5.227919 |
| Mean | 50062.892 |
| Median Absolute Deviation (MAD) | 16529.5 |
| Skewness | 1.8878689 |
| Sum | 2.5031446 × 109 |
| Variance | 1.0647496 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15000 | 2644 | 5.3% |
| 250000 | 59 | 0.1% |
| 24060 | 7 | < 0.1% |
| 27578 | 7 | < 0.1% |
| 21972 | 7 | < 0.1% |
| 21307 | 6 | < 0.1% |
| 23104 | 6 | < 0.1% |
| 35518 | 6 | < 0.1% |
| 40312 | 6 | < 0.1% |
| 43931 | 6 | < 0.1% |
| Other values (35760) | 47246 |
| Value | Count | Frequency (%) |
| 15000 | 2644 | |
| 15001 | 1 | < 0.1% |
| 15005 | 1 | < 0.1% |
| 15007 | 2 | < 0.1% |
| 15008 | 2 | < 0.1% |
| 15011 | 2 | < 0.1% |
| 15014 | 1 | < 0.1% |
| 15019 | 1 | < 0.1% |
| 15024 | 2 | < 0.1% |
| 15026 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 250000 | 59 | |
| 249071 | 1 | < 0.1% |
| 248846 | 1 | < 0.1% |
| 248250 | 1 | < 0.1% |
| 247835 | 1 | < 0.1% |
| 247621 | 1 | < 0.1% |
| 246598 | 1 | < 0.1% |
| 245424 | 1 | < 0.1% |
| 245382 | 1 | < 0.1% |
| 244286 | 1 | < 0.1% |
credit_score
Real number (ℝ)
High correlation
| Distinct | 432 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 643.61482 |
| Minimum | 348 |
|---|---|
| Maximum | 850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 348 |
|---|---|
| 5-th percentile | 538 |
| Q1 | 600 |
| median | 643 |
| Q3 | 687 |
| 95-th percentile | 751 |
| Maximum | 850 |
| Range | 502 |
| Interquartile range (IQR) | 87 |
Descriptive statistics
| Standard deviation | 64.731518 |
|---|---|
| Coefficient of variation (CV) | 0.10057493 |
| Kurtosis | -0.043807213 |
| Mean | 643.61482 |
| Median Absolute Deviation (MAD) | 44 |
| Skewness | 0.012996168 |
| Sum | 32180741 |
| Variance | 4190.1695 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 646 | 334 | 0.7% |
| 641 | 332 | 0.7% |
| 628 | 327 | 0.7% |
| 651 | 320 | 0.6% |
| 648 | 320 | 0.6% |
| 625 | 319 | 0.6% |
| 663 | 318 | 0.6% |
| 635 | 318 | 0.6% |
| 657 | 316 | 0.6% |
| 650 | 314 | 0.6% |
| Other values (422) | 46782 |
| Value | Count | Frequency (%) |
| 348 | 1 | < 0.1% |
| 375 | 1 | < 0.1% |
| 396 | 1 | < 0.1% |
| 401 | 1 | < 0.1% |
| 402 | 1 | < 0.1% |
| 408 | 1 | < 0.1% |
| 410 | 1 | < 0.1% |
| 414 | 1 | < 0.1% |
| 421 | 3 | |
| 422 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 850 | 43 | |
| 849 | 1 | < 0.1% |
| 848 | 1 | < 0.1% |
| 847 | 1 | < 0.1% |
| 846 | 1 | < 0.1% |
| 844 | 1 | < 0.1% |
| 843 | 8 | < 0.1% |
| 842 | 1 | < 0.1% |
| 841 | 1 | < 0.1% |
| 840 | 3 | < 0.1% |
credit_history_years
Real number (ℝ)
High correlation
| Distinct | 301 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.168274 |
| Minimum | 0 |
|---|---|
| Maximum | 30 |
| Zeros | 302 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.4 |
| Q1 | 2 |
| median | 6.1 |
| Q3 | 12.6 |
| 95-th percentile | 23 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 10.6 |
Descriptive statistics
| Standard deviation | 7.2075523 |
|---|---|
| Coefficient of variation (CV) | 0.88238376 |
| Kurtosis | 0.075125989 |
| Mean | 8.168274 |
| Median Absolute Deviation (MAD) | 4.6 |
| Skewness | 0.95375516 |
| Sum | 408413.7 |
| Variance | 51.94881 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.6 | 677 | 1.4% |
| 0.6 | 667 | 1.3% |
| 0.5 | 665 | 1.3% |
| 0.7 | 660 | 1.3% |
| 1.5 | 656 | 1.3% |
| 0.8 | 654 | 1.3% |
| 1.9 | 653 | 1.3% |
| 1.3 | 650 | 1.3% |
| 1.7 | 645 | 1.3% |
| 1.1 | 640 | 1.3% |
| Other values (291) | 43433 |
| Value | Count | Frequency (%) |
| 0 | 302 | |
| 0.1 | 623 | |
| 0.2 | 615 | |
| 0.3 | 630 | |
| 0.4 | 631 | |
| 0.5 | 665 | |
| 0.6 | 667 | |
| 0.7 | 660 | |
| 0.8 | 654 | |
| 0.9 | 618 |
| Value | Count | Frequency (%) |
| 30 | 11 | < 0.1% |
| 29.9 | 23 | |
| 29.8 | 19 | |
| 29.7 | 19 | |
| 29.6 | 27 | |
| 29.5 | 29 | |
| 29.4 | 20 | |
| 29.3 | 26 | |
| 29.2 | 17 | |
| 29.1 | 20 |
savings_assets
Real number (ℝ)
High correlation
| Distinct | 10382 |
|---|---|
| Distinct (%) | 20.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3595.6194 |
| Minimum | 0 |
|---|---|
| Maximum | 300000 |
| Zeros | 435 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 130 |
| median | 568 |
| Q3 | 2271 |
| 95-th percentile | 14963.15 |
| Maximum | 300000 |
| Range | 300000 |
| Interquartile range (IQR) | 2141 |
Descriptive statistics
| Standard deviation | 13232.399 |
|---|---|
| Coefficient of variation (CV) | 3.6801446 |
| Kurtosis | 203.38951 |
| Mean | 3595.6194 |
| Median Absolute Deviation (MAD) | 527 |
| Skewness | 12.054946 |
| Sum | 1.7978097 × 108 |
| Variance | 1.7509639 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 435 | 0.9% |
| 7 | 201 | 0.4% |
| 3 | 200 | 0.4% |
| 9 | 190 | 0.4% |
| 5 | 187 | 0.4% |
| 4 | 187 | 0.4% |
| 6 | 178 | 0.4% |
| 8 | 176 | 0.4% |
| 1 | 175 | 0.4% |
| 2 | 172 | 0.3% |
| Other values (10372) | 47899 |
| Value | Count | Frequency (%) |
| 0 | 435 | |
| 1 | 175 | |
| 2 | 172 | 0.3% |
| 3 | 200 | |
| 4 | 187 | |
| 5 | 187 | |
| 6 | 178 | |
| 7 | 201 | |
| 8 | 176 | |
| 9 | 190 |
| Value | Count | Frequency (%) |
| 300000 | 19 | |
| 290914 | 1 | < 0.1% |
| 289182 | 1 | < 0.1% |
| 283066 | 1 | < 0.1% |
| 277132 | 1 | < 0.1% |
| 273849 | 1 | < 0.1% |
| 273452 | 1 | < 0.1% |
| 270980 | 1 | < 0.1% |
| 268880 | 1 | < 0.1% |
| 268448 | 1 | < 0.1% |
current_debt
Real number (ℝ)
High correlation
| Distinct | 25350 |
|---|---|
| Distinct (%) | 50.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14290.442 |
| Minimum | 60 |
|---|---|
| Maximum | 163344 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 60 |
|---|---|
| 5-th percentile | 2018.95 |
| Q1 | 5581 |
| median | 10385 |
| Q3 | 18449.25 |
| 95-th percentile | 39815.4 |
| Maximum | 163344 |
| Range | 163284 |
| Interquartile range (IQR) | 12868.25 |
Descriptive statistics
| Standard deviation | 13243.757 |
|---|---|
| Coefficient of variation (CV) | 0.92675631 |
| Kurtosis | 9.2589019 |
| Mean | 14290.442 |
| Median Absolute Deviation (MAD) | 5732 |
| Skewness | 2.4378369 |
| Sum | 7.1452211 × 108 |
| Variance | 1.7539711 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5706 | 12 | < 0.1% |
| 4377 | 11 | < 0.1% |
| 5787 | 10 | < 0.1% |
| 3624 | 10 | < 0.1% |
| 3615 | 10 | < 0.1% |
| 5371 | 9 | < 0.1% |
| 2710 | 9 | < 0.1% |
| 3920 | 9 | < 0.1% |
| 4438 | 9 | < 0.1% |
| 9016 | 9 | < 0.1% |
| Other values (25340) | 49902 |
| Value | Count | Frequency (%) |
| 60 | 1 | |
| 64 | 2 | |
| 76 | 1 | |
| 85 | 1 | |
| 95 | 1 | |
| 103 | 1 | |
| 117 | 1 | |
| 118 | 1 | |
| 120 | 1 | |
| 122 | 1 |
| Value | Count | Frequency (%) |
| 163344 | 1 | |
| 145279 | 1 | |
| 141738 | 1 | |
| 136126 | 1 | |
| 135855 | 1 | |
| 135008 | 1 | |
| 132063 | 1 | |
| 131918 | 1 | |
| 125050 | 1 | |
| 124322 | 1 |
defaults_on_file
Categorical
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 0 | |
|---|---|
| 1 | 2674 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 47326 | |
| 1 | 2674 | 5.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 47326 | |
| 1 | 2674 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 47326 | |
| 1 | 2674 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 50000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 47326 | |
| 1 | 2674 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 50000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 47326 | |
| 1 | 2674 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 50000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 47326 | |
| 1 | 2674 | 5.3% |
delinquencies_last_2yrs
Real number (ℝ)
Zeros
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.55464 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 30797 |
| Zeros (%) | 61.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.84504956 |
|---|---|
| Coefficient of variation (CV) | 1.5236001 |
| Kurtosis | 4.0913995 |
| Mean | 0.55464 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.8169443 |
| Sum | 27732 |
| Variance | 0.71410875 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 30797 | |
| 1 | 13043 | |
| 2 | 4399 | 8.8% |
| 3 | 1280 | 2.6% |
| 4 | 384 | 0.8% |
| 5 | 77 | 0.2% |
| 6 | 13 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 30797 | |
| 1 | 13043 | |
| 2 | 4399 | 8.8% |
| 3 | 1280 | 2.6% |
| 4 | 384 | 0.8% |
| 5 | 77 | 0.2% |
| 6 | 13 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 5 | < 0.1% |
| 6 | 13 | < 0.1% |
| 5 | 77 | 0.2% |
| 4 | 384 | 0.8% |
| 3 | 1280 | 2.6% |
| 2 | 4399 | 8.8% |
| 1 | 13043 | |
| 0 | 30797 |
derogatory_marks
Categorical
Imbalance
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 758 |
| 3 | 102 |
| 4 | 9 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 43607 | |
| 1 | 5524 | 11.0% |
| 2 | 758 | 1.5% |
| 3 | 102 | 0.2% |
| 4 | 9 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 43607 | |
| 1 | 5524 | 11.0% |
| 2 | 758 | 1.5% |
| 3 | 102 | 0.2% |
| 4 | 9 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 43607 | |
| 1 | 5524 | 11.0% |
| 2 | 758 | 1.5% |
| 3 | 102 | 0.2% |
| 4 | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 50000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 43607 | |
| 1 | 5524 | 11.0% |
| 2 | 758 | 1.5% |
| 3 | 102 | 0.2% |
| 4 | 9 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 50000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 43607 | |
| 1 | 5524 | 11.0% |
| 2 | 758 | 1.5% |
| 3 | 102 | 0.2% |
| 4 | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 50000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 43607 | |
| 1 | 5524 | 11.0% |
| 2 | 758 | 1.5% |
| 3 | 102 | 0.2% |
| 4 | 9 | < 0.1% |
product_type
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| Credit Card | |
|---|---|
| Personal Loan | |
| Line of Credit |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 12.30224 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Credit Card |
|---|---|
| 2nd row | Personal Loan |
| 3rd row | Credit Card |
| 4th row | Credit Card |
| 5th row | Personal Loan |
Common Values
| Value | Count | Frequency (%) |
| Credit Card | 22455 | |
| Personal Loan | 17523 | |
| Line of Credit | 10022 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| credit | 32477 | |
| card | 22455 | |
| personal | 17523 | |
| loan | 17523 | |
| line | 10022 | 9.1% |
| of | 10022 | 9.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 72455 | |
| e | 60022 | |
| 60022 | ||
| a | 57501 | |
| C | 54932 | |
| d | 54932 | |
| o | 45068 | |
| n | 45068 | |
| i | 42499 | |
| t | 32477 | 5.3% |
| Other values (5) | 90136 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 615112 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 72455 | |
| e | 60022 | |
| 60022 | ||
| a | 57501 | |
| C | 54932 | |
| d | 54932 | |
| o | 45068 | |
| n | 45068 | |
| i | 42499 | |
| t | 32477 | 5.3% |
| Other values (5) | 90136 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 615112 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 72455 | |
| e | 60022 | |
| 60022 | ||
| a | 57501 | |
| C | 54932 | |
| d | 54932 | |
| o | 45068 | |
| n | 45068 | |
| i | 42499 | |
| t | 32477 | 5.3% |
| Other values (5) | 90136 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 615112 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 72455 | |
| e | 60022 | |
| 60022 | ||
| a | 57501 | |
| C | 54932 | |
| d | 54932 | |
| o | 45068 | |
| n | 45068 | |
| i | 42499 | |
| t | 32477 | 5.3% |
| Other values (5) | 90136 |
loan_intent
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| Personal | |
|---|---|
| Education | |
| Medical | |
| Business | |
| Home Improvement |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 10.2266 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Business |
|---|---|
| 2nd row | Home Improvement |
| 3rd row | Debt Consolidation |
| 4th row | Business |
| 5th row | Education |
Common Values
| Value | Count | Frequency (%) |
| Personal | 12429 | |
| Education | 10134 | |
| Medical | 7598 | |
| Business | 7469 | |
| Home Improvement | 7453 | |
| Debt Consolidation | 4917 | 9.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| personal | 12429 | |
| education | 10134 | |
| medical | 7598 | |
| business | 7469 | |
| home | 7453 | |
| improvement | 7453 | |
| debt | 4917 | 7.9% |
| consolidation | 4917 | 7.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 54772 | 10.7% |
| o | 52220 | 10.2% |
| n | 47319 | 9.3% |
| s | 39753 | 7.8% |
| a | 35078 | 6.9% |
| i | 35035 | 6.9% |
| t | 27421 | 5.4% |
| l | 24944 | 4.9% |
| d | 22649 | 4.4% |
| m | 22359 | 4.4% |
| Other values (15) | 149780 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 511330 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 54772 | 10.7% |
| o | 52220 | 10.2% |
| n | 47319 | 9.3% |
| s | 39753 | 7.8% |
| a | 35078 | 6.9% |
| i | 35035 | 6.9% |
| t | 27421 | 5.4% |
| l | 24944 | 4.9% |
| d | 22649 | 4.4% |
| m | 22359 | 4.4% |
| Other values (15) | 149780 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 511330 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 54772 | 10.7% |
| o | 52220 | 10.2% |
| n | 47319 | 9.3% |
| s | 39753 | 7.8% |
| a | 35078 | 6.9% |
| i | 35035 | 6.9% |
| t | 27421 | 5.4% |
| l | 24944 | 4.9% |
| d | 22649 | 4.4% |
| m | 22359 | 4.4% |
| Other values (15) | 149780 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 511330 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 54772 | 10.7% |
| o | 52220 | 10.2% |
| n | 47319 | 9.3% |
| s | 39753 | 7.8% |
| a | 35078 | 6.9% |
| i | 35035 | 6.9% |
| t | 27421 | 5.4% |
| l | 24944 | 4.9% |
| d | 22649 | 4.4% |
| m | 22359 | 4.4% |
| Other values (15) | 149780 |
loan_amount
Real number (ℝ)
High correlation
| Distinct | 996 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33041.874 |
| Minimum | 500 |
|---|---|
| Maximum | 100000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 2400 |
| Q1 | 12300 |
| median | 26100 |
| Q3 | 48500 |
| 95-th percentile | 90900 |
| Maximum | 100000 |
| Range | 99500 |
| Interquartile range (IQR) | 36200 |
Descriptive statistics
| Standard deviation | 26116.185 |
|---|---|
| Coefficient of variation (CV) | 0.79039661 |
| Kurtosis | 0.085771957 |
| Mean | 33041.874 |
| Median Absolute Deviation (MAD) | 16300 |
| Skewness | 0.93149232 |
| Sum | 1.6520937 × 109 |
| Variance | 6.8205512 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 70000 | 2554 | 5.1% |
| 100000 | 2014 | 4.0% |
| 1500 | 597 | 1.2% |
| 50000 | 553 | 1.1% |
| 5000 | 419 | 0.8% |
| 1800 | 178 | 0.4% |
| 1600 | 150 | 0.3% |
| 2300 | 148 | 0.3% |
| 1900 | 144 | 0.3% |
| 2600 | 138 | 0.3% |
| Other values (986) | 43105 |
| Value | Count | Frequency (%) |
| 500 | 116 | |
| 600 | 41 | 0.1% |
| 700 | 39 | 0.1% |
| 800 | 42 | 0.1% |
| 900 | 44 | 0.1% |
| 1000 | 132 | |
| 1100 | 81 | |
| 1200 | 66 | |
| 1300 | 65 | |
| 1400 | 64 |
| Value | Count | Frequency (%) |
| 100000 | 2014 | |
| 99900 | 3 | < 0.1% |
| 99800 | 1 | < 0.1% |
| 99700 | 4 | < 0.1% |
| 99600 | 5 | < 0.1% |
| 99500 | 9 | < 0.1% |
| 99400 | 8 | < 0.1% |
| 99300 | 4 | < 0.1% |
| 99200 | 2 | < 0.1% |
| 99100 | 4 | < 0.1% |
interest_rate
Real number (ℝ)
High correlation
| Distinct | 1687 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.498591 |
| Minimum | 6 |
|---|---|
| Maximum | 23 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 9.19 |
| Q1 | 12.18 |
| median | 15.44 |
| Q3 | 18.87 |
| 95-th percentile | 22.12 |
| Maximum | 23 |
| Range | 17 |
| Interquartile range (IQR) | 6.69 |
Descriptive statistics
| Standard deviation | 4.067942 |
|---|---|
| Coefficient of variation (CV) | 0.26247173 |
| Kurtosis | -0.98897991 |
| Mean | 15.498591 |
| Median Absolute Deviation (MAD) | 3.34 |
| Skewness | 0.019608004 |
| Sum | 774929.54 |
| Variance | 16.548152 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11.47 | 68 | 0.1% |
| 11.07 | 67 | 0.1% |
| 11.43 | 61 | 0.1% |
| 11.48 | 61 | 0.1% |
| 11.2 | 57 | 0.1% |
| 11.06 | 57 | 0.1% |
| 11.18 | 57 | 0.1% |
| 11.23 | 57 | 0.1% |
| 16.57 | 57 | 0.1% |
| 11.29 | 56 | 0.1% |
| Other values (1677) | 49402 |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 6.02 | 4 | |
| 6.03 | 1 | < 0.1% |
| 6.04 | 3 | |
| 6.05 | 1 | < 0.1% |
| 6.06 | 7 | |
| 6.07 | 3 | |
| 6.08 | 4 | |
| 6.1 | 3 | |
| 6.11 | 3 |
| Value | Count | Frequency (%) |
| 23 | 15 | |
| 22.99 | 33 | |
| 22.98 | 21 | |
| 22.97 | 22 | |
| 22.96 | 32 | |
| 22.95 | 36 | |
| 22.94 | 15 | |
| 22.93 | 20 | |
| 22.92 | 33 | |
| 22.91 | 30 |
debt_to_income_ratio
Real number (ℝ)
High correlation
| Distinct | 796 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.28572416 |
| Minimum | 0.002 |
|---|---|
| Maximum | 0.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0.002 |
|---|---|
| 5-th percentile | 0.063 |
| Q1 | 0.161 |
| median | 0.265 |
| Q3 | 0.389 |
| 95-th percentile | 0.585 |
| Maximum | 0.8 |
| Range | 0.798 |
| Interquartile range (IQR) | 0.228 |
Descriptive statistics
| Standard deviation | 0.15978652 |
|---|---|
| Coefficient of variation (CV) | 0.5592335 |
| Kurtosis | -0.15115295 |
| Mean | 0.28572416 |
| Median Absolute Deviation (MAD) | 0.113 |
| Skewness | 0.5914932 |
| Sum | 14286.208 |
| Variance | 0.025531733 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.221 | 155 | 0.3% |
| 0.197 | 148 | 0.3% |
| 0.224 | 147 | 0.3% |
| 0.151 | 143 | 0.3% |
| 0.169 | 143 | 0.3% |
| 0.195 | 143 | 0.3% |
| 0.269 | 140 | 0.3% |
| 0.203 | 138 | 0.3% |
| 0.236 | 135 | 0.3% |
| 0.25 | 134 | 0.3% |
| Other values (786) | 48574 |
| Value | Count | Frequency (%) |
| 0.002 | 4 | < 0.1% |
| 0.003 | 4 | < 0.1% |
| 0.004 | 7 | < 0.1% |
| 0.005 | 8 | < 0.1% |
| 0.006 | 13 | |
| 0.007 | 14 | |
| 0.008 | 8 | < 0.1% |
| 0.009 | 14 | |
| 0.01 | 17 | |
| 0.011 | 21 |
| Value | Count | Frequency (%) |
| 0.8 | 78 | |
| 0.799 | 4 | < 0.1% |
| 0.798 | 3 | < 0.1% |
| 0.797 | 2 | < 0.1% |
| 0.796 | 1 | < 0.1% |
| 0.795 | 2 | < 0.1% |
| 0.794 | 3 | < 0.1% |
| 0.792 | 6 | < 0.1% |
| 0.791 | 5 | < 0.1% |
| 0.79 | 2 | < 0.1% |
loan_to_income_ratio
Real number (ℝ)
High correlation
| Distinct | 1992 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.70199866 |
| Minimum | 0.008 |
|---|---|
| Maximum | 2.001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0.008 |
|---|---|
| 5-th percentile | 0.099 |
| Q1 | 0.333 |
| median | 0.622 |
| Q3 | 1.01025 |
| 95-th percentile | 1.624 |
| Maximum | 2.001 |
| Range | 1.993 |
| Interquartile range (IQR) | 0.67725 |
Descriptive statistics
| Standard deviation | 0.46578752 |
|---|---|
| Coefficient of variation (CV) | 0.66351625 |
| Kurtosis | -0.27296493 |
| Mean | 0.70199866 |
| Median Absolute Deviation (MAD) | 0.326 |
| Skewness | 0.65752654 |
| Sum | 35099.933 |
| Variance | 0.21695802 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.1 | 991 | 2.0% |
| 0.101 | 403 | 0.8% |
| 0.099 | 402 | 0.8% |
| 0.098 | 191 | 0.4% |
| 0.102 | 186 | 0.4% |
| 0.333 | 172 | 0.3% |
| 0.387 | 70 | 0.1% |
| 0.553 | 66 | 0.1% |
| 0.447 | 65 | 0.1% |
| 0.46 | 63 | 0.1% |
| Other values (1982) | 47391 |
| Value | Count | Frequency (%) |
| 0.008 | 1 | < 0.1% |
| 0.01 | 5 | < 0.1% |
| 0.011 | 7 | < 0.1% |
| 0.012 | 15 | |
| 0.013 | 17 | |
| 0.014 | 9 | |
| 0.015 | 15 | |
| 0.016 | 13 | |
| 0.017 | 22 | |
| 0.018 | 21 |
| Value | Count | Frequency (%) |
| 2.001 | 4 | |
| 2 | 6 | |
| 1.999 | 4 | |
| 1.998 | 6 | |
| 1.997 | 2 | < 0.1% |
| 1.996 | 2 | < 0.1% |
| 1.995 | 3 | |
| 1.994 | 4 | |
| 1.993 | 4 | |
| 1.992 | 7 |
payment_to_income_ratio
Real number (ℝ)
High correlation
| Distinct | 665 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.23399494 |
| Minimum | 0.003 |
|---|---|
| Maximum | 0.667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 390.8 KiB |
Quantile statistics
| Minimum | 0.003 |
|---|---|
| 5-th percentile | 0.033 |
| Q1 | 0.111 |
| median | 0.207 |
| Q3 | 0.337 |
| 95-th percentile | 0.541 |
| Maximum | 0.667 |
| Range | 0.664 |
| Interquartile range (IQR) | 0.226 |
Descriptive statistics
| Standard deviation | 0.1552681 |
|---|---|
| Coefficient of variation (CV) | 0.66355322 |
| Kurtosis | -0.27310773 |
| Mean | 0.23399494 |
| Median Absolute Deviation (MAD) | 0.109 |
| Skewness | 0.65738493 |
| Sum | 11699.747 |
| Variance | 0.024108182 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.033 | 1583 | 3.2% |
| 0.034 | 638 | 1.3% |
| 0.111 | 251 | 0.5% |
| 0.129 | 167 | 0.3% |
| 0.176 | 156 | 0.3% |
| 0.189 | 155 | 0.3% |
| 0.156 | 152 | 0.3% |
| 0.151 | 151 | 0.3% |
| 0.118 | 149 | 0.3% |
| 0.178 | 148 | 0.3% |
| Other values (655) | 46450 |
| Value | Count | Frequency (%) |
| 0.003 | 6 | < 0.1% |
| 0.004 | 39 | 0.1% |
| 0.005 | 37 | 0.1% |
| 0.006 | 57 | |
| 0.007 | 65 | |
| 0.008 | 58 | |
| 0.009 | 70 | |
| 0.01 | 71 | |
| 0.011 | 100 | |
| 0.012 | 93 |
| Value | Count | Frequency (%) |
| 0.667 | 10 | |
| 0.666 | 12 | |
| 0.665 | 9 | |
| 0.664 | 14 | |
| 0.663 | 7 | < 0.1% |
| 0.662 | 20 | |
| 0.661 | 9 | |
| 0.66 | 22 | |
| 0.659 | 14 | |
| 0.658 | 14 |
loan_status
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.8 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 27523 | |
| 0 | 22477 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 27523 | |
| 0 | 22477 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 27523 | |
| 0 | 22477 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 50000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 27523 | |
| 0 | 22477 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 50000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 27523 | |
| 0 | 22477 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 50000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 27523 | |
| 0 | 22477 |
Interactions
Correlations
| age | annual_income | credit_history_years | credit_score | current_debt | debt_to_income_ratio | defaults_on_file | delinquencies_last_2yrs | derogatory_marks | interest_rate | loan_amount | loan_intent | loan_status | loan_to_income_ratio | occupation_status | payment_to_income_ratio | product_type | savings_assets | years_employed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | 0.310 | 0.664 | 0.359 | 0.199 | -0.008 | 0.063 | -0.106 | 0.050 | -0.179 | 0.169 | 0.001 | 0.317 | -0.032 | 0.000 | -0.032 | 0.000 | 0.458 | 0.643 |
| annual_income | 0.310 | 1.000 | 0.205 | 0.186 | 0.657 | -0.002 | 0.035 | -0.055 | 0.020 | -0.095 | 0.581 | 0.005 | 0.157 | -0.039 | 0.264 | -0.039 | 0.000 | 0.390 | 0.337 |
| credit_history_years | 0.664 | 0.205 | 1.000 | 0.236 | 0.135 | -0.002 | 0.046 | -0.066 | 0.033 | -0.119 | 0.110 | 0.000 | 0.297 | -0.022 | 0.000 | -0.022 | 0.010 | 0.627 | 0.458 |
| credit_score | 0.359 | 0.186 | 0.236 | 1.000 | 0.120 | -0.005 | 0.259 | -0.318 | 0.184 | -0.487 | 0.105 | 0.000 | 0.540 | -0.017 | 0.022 | -0.017 | 0.000 | 0.190 | 0.234 |
| current_debt | 0.199 | 0.657 | 0.135 | 0.120 | 1.000 | 0.716 | 0.018 | -0.034 | 0.013 | -0.058 | 0.381 | 0.005 | 0.085 | -0.034 | 0.169 | -0.034 | 0.000 | 0.262 | 0.218 |
| debt_to_income_ratio | -0.008 | -0.002 | -0.002 | -0.005 | 0.716 | 1.000 | 0.013 | 0.004 | 0.008 | 0.007 | -0.001 | 0.000 | 0.369 | 0.000 | 0.013 | 0.000 | 0.003 | 0.004 | -0.006 |
| defaults_on_file | 0.063 | 0.035 | 0.046 | 0.259 | 0.018 | 0.013 | 1.000 | 0.258 | 0.099 | 0.181 | 0.015 | 0.000 | 0.263 | 0.003 | 0.000 | 0.003 | 0.000 | 0.008 | 0.040 |
| delinquencies_last_2yrs | -0.106 | -0.055 | -0.066 | -0.318 | -0.034 | 0.004 | 0.258 | 1.000 | 0.042 | 0.165 | -0.033 | 0.000 | 0.321 | 0.002 | 0.000 | 0.002 | 0.000 | -0.059 | -0.066 |
| derogatory_marks | 0.050 | 0.020 | 0.033 | 0.184 | 0.013 | 0.008 | 0.099 | 0.042 | 1.000 | 0.128 | 0.010 | 0.000 | 0.227 | 0.006 | 0.010 | 0.006 | 0.000 | 0.000 | 0.031 |
| interest_rate | -0.179 | -0.095 | -0.119 | -0.487 | -0.058 | 0.007 | 0.181 | 0.165 | 0.128 | 1.000 | -0.105 | 0.000 | 0.376 | -0.064 | 0.016 | -0.064 | 0.670 | -0.095 | -0.120 |
| loan_amount | 0.169 | 0.581 | 0.110 | 0.105 | 0.381 | -0.001 | 0.015 | -0.033 | 0.010 | -0.105 | 1.000 | 0.000 | 0.084 | 0.755 | 0.266 | 0.755 | 0.359 | 0.226 | 0.235 |
| loan_intent | 0.001 | 0.005 | 0.000 | 0.000 | 0.005 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.192 | 0.000 | 0.000 | 0.000 | 0.000 | 0.006 | 0.000 |
| loan_status | 0.317 | 0.157 | 0.297 | 0.540 | 0.085 | 0.369 | 0.263 | 0.321 | 0.227 | 0.376 | 0.084 | 0.192 | 1.000 | 0.197 | 0.021 | 0.197 | 0.120 | 0.069 | 0.222 |
| loan_to_income_ratio | -0.032 | -0.039 | -0.022 | -0.017 | -0.034 | 0.000 | 0.003 | 0.002 | 0.006 | -0.064 | 0.755 | 0.000 | 0.197 | 1.000 | 0.236 | 1.000 | 0.410 | -0.023 | 0.048 |
| occupation_status | 0.000 | 0.264 | 0.000 | 0.022 | 0.169 | 0.013 | 0.000 | 0.000 | 0.010 | 0.016 | 0.266 | 0.000 | 0.021 | 0.236 | 1.000 | 0.236 | 0.004 | 0.029 | 0.263 |
| payment_to_income_ratio | -0.032 | -0.039 | -0.022 | -0.017 | -0.034 | 0.000 | 0.003 | 0.002 | 0.006 | -0.064 | 0.755 | 0.000 | 0.197 | 1.000 | 0.236 | 1.000 | 0.411 | -0.023 | 0.048 |
| product_type | 0.000 | 0.000 | 0.010 | 0.000 | 0.000 | 0.003 | 0.000 | 0.000 | 0.000 | 0.670 | 0.359 | 0.000 | 0.120 | 0.410 | 0.004 | 0.411 | 1.000 | 0.000 | 0.012 |
| savings_assets | 0.458 | 0.390 | 0.627 | 0.190 | 0.262 | 0.004 | 0.008 | -0.059 | 0.000 | -0.095 | 0.226 | 0.006 | 0.069 | -0.023 | 0.029 | -0.023 | 0.000 | 1.000 | 0.358 |
| years_employed | 0.643 | 0.337 | 0.458 | 0.234 | 0.218 | -0.006 | 0.040 | -0.066 | 0.031 | -0.120 | 0.235 | 0.000 | 0.222 | 0.048 | 0.263 | 0.048 | 0.012 | 0.358 | 1.000 |
Missing values
Sample
| customer_id | age | occupation_status | years_employed | annual_income | credit_score | credit_history_years | savings_assets | current_debt | defaults_on_file | delinquencies_last_2yrs | derogatory_marks | product_type | loan_intent | loan_amount | interest_rate | debt_to_income_ratio | loan_to_income_ratio | payment_to_income_ratio | loan_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CUST100000 | 40 | Employed | 17.2 | 25579 | 692 | 5.3 | 895 | 10820 | 0 | 0 | 0 | Credit Card | Business | 600 | 17.02 | 0.423 | 0.023 | 0.008 | 1 |
| 1 | CUST100001 | 33 | Employed | 7.3 | 43087 | 627 | 3.5 | 169 | 16550 | 0 | 1 | 0 | Personal Loan | Home Improvement | 53300 | 14.10 | 0.384 | 1.237 | 0.412 | 0 |
| 2 | CUST100002 | 42 | Student | 1.1 | 20840 | 689 | 8.4 | 17 | 7852 | 0 | 0 | 0 | Credit Card | Debt Consolidation | 2100 | 18.33 | 0.377 | 0.101 | 0.034 | 1 |
| 3 | CUST100003 | 53 | Student | 0.5 | 29147 | 692 | 9.8 | 1480 | 11603 | 0 | 1 | 0 | Credit Card | Business | 2900 | 18.74 | 0.398 | 0.099 | 0.033 | 1 |
| 4 | CUST100004 | 32 | Employed | 12.5 | 63657 | 630 | 7.2 | 209 | 12424 | 0 | 0 | 0 | Personal Loan | Education | 99600 | 13.92 | 0.195 | 1.565 | 0.522 | 1 |
| 5 | CUST100005 | 32 | Employed | 13.4 | 32015 | 570 | 7.3 | 253 | 1120 | 0 | 0 | 2 | Credit Card | Personal | 37000 | 22.92 | 0.035 | 1.156 | 0.385 | 0 |
| 6 | CUST100006 | 53 | Employed | 22.9 | 44989 | 674 | 11.1 | 19667 | 19298 | 0 | 0 | 0 | Personal Loan | Home Improvement | 45600 | 11.02 | 0.429 | 1.014 | 0.338 | 1 |
| 7 | CUST100007 | 44 | Self-Employed | 4.2 | 80603 | 625 | 18.5 | 830 | 38382 | 0 | 0 | 0 | Credit Card | Personal | 51700 | 19.42 | 0.476 | 0.641 | 0.214 | 1 |
| 8 | CUST100008 | 29 | Employed | 5.9 | 28416 | 569 | 2.6 | 1334 | 22668 | 1 | 2 | 0 | Credit Card | Education | 33800 | 22.72 | 0.798 | 1.189 | 0.396 | 0 |
| 9 | CUST100009 | 41 | Employed | 7.0 | 70717 | 638 | 21.5 | 1578 | 21394 | 0 | 1 | 0 | Credit Card | Personal | 70000 | 19.35 | 0.303 | 0.990 | 0.330 | 1 |
| customer_id | age | occupation_status | years_employed | annual_income | credit_score | credit_history_years | savings_assets | current_debt | defaults_on_file | delinquencies_last_2yrs | derogatory_marks | product_type | loan_intent | loan_amount | interest_rate | debt_to_income_ratio | loan_to_income_ratio | payment_to_income_ratio | loan_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 49990 | CUST149990 | 37 | Employed | 9.4 | 37218 | 588 | 11.7 | 2389 | 13416 | 0 | 0 | 0 | Credit Card | Personal | 10600 | 21.35 | 0.360 | 0.285 | 0.095 | 0 |
| 49991 | CUST149991 | 26 | Employed | 0.7 | 22159 | 550 | 3.7 | 1174 | 12699 | 0 | 0 | 0 | Credit Card | Personal | 24600 | 21.52 | 0.573 | 1.110 | 0.370 | 0 |
| 49992 | CUST149992 | 30 | Employed | 9.3 | 24797 | 610 | 8.3 | 16 | 6282 | 0 | 1 | 0 | Personal Loan | Medical | 32100 | 14.98 | 0.253 | 1.295 | 0.432 | 0 |
| 49993 | CUST149993 | 53 | Employed | 18.1 | 36948 | 676 | 2.6 | 59 | 11421 | 0 | 0 | 1 | Credit Card | Personal | 2400 | 18.06 | 0.309 | 0.065 | 0.022 | 1 |
| 49994 | CUST149994 | 18 | Employed | 0.0 | 64343 | 585 | 1.3 | 277 | 27914 | 0 | 0 | 2 | Personal Loan | Personal | 41600 | 15.42 | 0.434 | 0.647 | 0.216 | 0 |
| 49995 | CUST149995 | 35 | Employed | 4.3 | 39449 | 570 | 16.3 | 1127 | 7576 | 0 | 0 | 0 | Credit Card | Education | 42800 | 21.31 | 0.192 | 1.085 | 0.362 | 0 |
| 49996 | CUST149996 | 34 | Employed | 4.4 | 20496 | 672 | 12.6 | 1478 | 6276 | 1 | 0 | 0 | Credit Card | Personal | 3800 | 18.07 | 0.306 | 0.185 | 0.062 | 0 |
| 49997 | CUST149997 | 41 | Self-Employed | 4.8 | 18743 | 719 | 10.1 | 17 | 10331 | 0 | 0 | 0 | Credit Card | Personal | 18000 | 17.45 | 0.551 | 0.960 | 0.320 | 0 |
| 49998 | CUST149998 | 38 | Student | 0.4 | 17250 | 633 | 1.3 | 5 | 7779 | 0 | 0 | 1 | Personal Loan | Personal | 1400 | 14.71 | 0.451 | 0.081 | 0.027 | 0 |
| 49999 | CUST149999 | 53 | Employed | 17.2 | 46923 | 695 | 15.0 | 8332 | 2655 | 0 | 1 | 0 | Personal Loan | Personal | 41000 | 12.39 | 0.057 | 0.874 | 0.291 | 1 |